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Abstract — A method is described for (be automated detection 
of microcalcifications in digitized mammograms. The method is 
based on the Lapbeinn scale-space representation of the mam. 
mogram only. First, possible locations of microcalciflcalions are 
identified as local raaslroe in the filtered image on a range of 
scales. For each finding, the size and local contrast is estimated, 
based on (he Laplacian response denoted as the scale-space signa- 
ture. A finding Is marked as a mieroceleiflcotlon if the estimated 
contrast is larger than a predefined threshold which depends 
on the size of the finding. It is shown that the signature has a 
characteristic peak, revealing the corresponding image features. 
This peak can be robustly determined. The basic method is 
significantly Improved by consideration of the statistical variation 
of the estimated contrast, which is the result of the compter noise 
characteristic of the 
with the Nijmegen ' 

rtsp^n^reUiver operating characteristic' (FROC) performance. 
At a rate of one false positive cluster per image the method 
reaches a sensitivity or 084. which is comparable to the best 
results achieved so far. 



I. Introduction 

BREAST cancer is the most common malignant disease 
among women in the USA and Europe. Statistics indicate 
thai in the USA approximately one in ten women will develop 
breast cancer during lifetime (I]. Mammography is widely 
regarded as Ihe most effective method for the early detection 
of breast cancer, since tumors may be visualized by indirect 
signs such as the presence of clustered microcalcifications 
and architectural distortion. The radiological interpretation of 
mammograms, however, is a difficult task, since the mam- 
mographic appearance of normal tissue is highly variable. The 
nature of many abnormal patterns is subtle and their perception 
is often limited by low attenuation contrast. 

In particular, the effectiveness of breast cancer screening, 
where abnormalities have to be detected at an early stage 
in a large number of asymptomatic women, depends on the 
radiologist's ability to detect signs of malignancy. For this 
reason, independent reading of screening mammograms by two 
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radiologists is suggested lo reduce the number of interpretation 
errors. In spile of this, retrospective analysis of previous 
mammograms still shows a substantia) fraction of missed 
cancers. Considerable research has been undertaken in the 
development of automated image analysis methods to assist 
radiologists in the Identification of abnormal lesions. Recent 
studies demonstrate that computer-based prompting systems, 
which indicate suspicious regions in the mammogram, have 
promising potential to reduce the number of lesions missed by 
radiologists |2], (3). 

Much of the current work on computerized detection of 
mammography abnormalities focuses on the recognition of 
clustered microcalcifications [4]-(7]. This might be partly due 
to their clinical importance, about 30-30% of mammograph- 
icaJly detected cancer contains microcalcifications visible in 
the image 18), but is also due to their characteristic image 
features, which can be addressed by various pattern recognition 

are calcium deposits as small as 0.1-0.3 
which can be identified in mammograms as 
tiny areas that are slightly brighter than their surrounding 
tissue. Only clusters of three or more particles within a 
region of approximately I cm 7 are considered as clinically 
suspicious (91. The morphologic appearance or microcal- 
cifications has been emphasized in the last decades [10]. 
Pleomorphic, curvilinear, or branched shapes are usually found 
within malignant lesions, while sharply outlined pearl-like 



mammogram containing a malignant-type microcalcification 
cluster is shown in Figs. 1 and 2. 

Detection algorithms are frequently evaluated with a local 
database, which maXes it difficult to objectively compare 
the performance of the different methods. We are using a 
publicly available database,' ihe Nijmegen database (II], 
which has been widely used by researchers for performance 
measurements. In the following, we briefly discuss previous 
work evaluated with this database. Kegelmeyer and Allmen 
1 12], for example, apply several image segmentation methods 
and image features to identify microcalcifications. The features 
are classified by binary decision trees. The features determined 
for training images are balanced with truth information to grow 
Ihe trees. Pixels of unknown mammograms are then labeled, 
due to the trees yielding a map indicating the probability that a 
pixel belongs to a microcalcification. The performance of their 
approach, however, is rather poor, which they partly explain by 
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microcalcificaiion clusters of the Nijmegen database, with 
2.3 false clusters per image although a different overlap 
criterion is used for the evaluation. The best results for the 
database obtained so far, however, are by Karssemeijer fit]. 
He describes a Bayesian method utilizing a Markov random 
field (MRF) model [17], [18]. We will later focus on this 
approach in more detail. 

The paper is organized as follows. In Section III, a novel 
method for the detection of individual microcalcifications is 
introduced [19], [20], Potential microcalcifications are first 
identified with a multiscate spot detector based on Lapladan 
filter kernels. Then, their size and local image contrast are 
robustly estimated, using the filter response of their center 
pixel. The basic method is further improved by consideration 
of the statistical variation of the extracted features caused by 
image noise. In Section IV, the method is evaluated with the 
Nijmegen database and the improvements ai 



Fig. I. The sue of the region U about 2 cm (200 pixel). 



the use of impure training data. Meersman et ai [ 1 3] use neural 
networks with varying dimensions. Each pixel in a 64 x 64 
region is trained by feeding its squared neighborhood of 9 x 9 
pixels as the input to the network. The performance of the 
network is optimized by means of the mean square error of 
the network output and the correct segmentation. Kimmig el aL 
[14] apply matched filter theory for the detection and area and 
shape features to eliminate various false detections. A wavelet 
approach for the recognition of individual microcalcifications 
is suggested by Strickland and Hahn [IS]. They implemented a 
multiscale matched filler bank by means of a suitable wavelet 
transform, which is superior to the results of Kimmig. A 
method based on adaptive filtering and a Gaussianity test of 
the prediction error is proposed by Gtircan el aL [I6J. For 
the prediction of pixel values, a two-dimensional (2-D) least 
n square adaptive filter is applied. Using experimentally 
' e scheme is able to delect all true 



jer's MRF 

II. Material 

- The Nijmegen database contains 40 digitized mammograms, 
composed of both craniocaudal and mediolatera) oblique views 
recorded from 21 patients. The images are digitized with a 
CCD camera from screen-film at a pixel size of 0.1 mm 
and a 12-bit grayscale linear to the transmitted light during 
digitization. The image matrix is 2048 x 2048 pixels. Each 
mammogram shows one or more microcalcifi cation clusters 
verified by histology. The total number or clusters in the 
database is 10S. The location and size of each cluster is given 
by a truth circle marked by expert radiologists. The database 
also contains look-up tables for rescaling of the image data and 
text files, storing the center and diameter of the truth circles 
in pixel coordinates. 

The evaluation of the performance of computerized detec- 
tion methods is usually addressed by free-response receiver op- 
erating characteristic (FROQ methodology [21], [22]. FROC 
analysis allows us to take the location of abnormal lesions in 
the mammogram into account. Images with multiple clusters 
and multiple computer-generated prompts can therefore be 
used within the evaluation. Based on the ground truth infor- 
mation provided by the database the number of true positive 
(TP) and false positive (FP) clusters are counted according 
to a given level of detection sensitivity, which is controlled 
by a corresponding parameter setting of the algorithm. The 
FROC diagram represents the fraction of TP clusters (TP 
fraction) as a function of the number of the FP clusters 
per image. Each parameter setting results in a corresponding 
operating point in the diagram. Several evaluations are needed 
to describe the performance of the method. The assessment of 
the significance of the results is usually addressed by fitting 
the experimental data to an underlying statistical model of 
observer performance. Although quantitative measures have 
been proposed for FROC analysis to compare the results of 
different detection schemes [22], such measures are rarely used 
in work on automated detection for significance testing. 
FROC results depend on the applied lest, which defines 
ings are regarded as FP or TP 
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clusters. To effectively compare the performance of different 
detection methods, not only (he same set of images must be 
used, but also the same overlap rule. Following Kaissemeijer 
(11], a I cm (100 pixel) diameter circle is inscribed around 
each individual finding. Intersecting disks are merged to a 
single connected region. A true cluster is considered to be 
delected if at least two findings are found in the associated 
truth circle marked by expert radiologists. Regions without 
findings in truth circles are counted as FP clusters if they 
contain more than one finding. The overlap rule also removes 
isolated findings within the I cm circle. Such findings are 
frequently caused by film scratches and emulsion errors. 

III. METHOD 

A. Motivation and Outline 

At a spatial resolution of 0.1 m pixel size, microcalci- 
ficatkms cover only a small number of image pixels and 
appear as small bright circular or slightly elongated spots in 
the mammogram. Therefore, we plan to solve the detection 
problem as follows: 

1) find bright, almost circular spots in the mammogram; 

2) estimate size D and local contrast C or each spot; 

3) mark a spot as a microcakification if C> Cr(D) where 
Or is a given threshold depending on the estimated size 
D of the spot. 

The threshold C T is used to control the sensitivity of the 
detection. For example, if Or is small and also diffuse, faint 
spots in the mammogram are regarded as possible microcal- 
cifications. The threshold C T depends on the size D since 
large solid mlcrocalciftcations absorb more X-ray energy and 
appear brighter in the mammogram. 

Spots of a different size are detected in step I) by a 
method proposed by Blostein and Ahuja (23]. The authors 
show that bright spots correspond to local maxima in Laplacian 
convolved images if the size of the filter kernel is chosen 
appropriately. To reliably detect all potential microcalcifica- 
tions, local maxima must be considered on a range of scales. 
Methods concerning the scale of objects to be detected by 
using convolutions for different kernel sizes are referred to as 
scale-space approaches [24], [25]. 

Further mouvation for the application of Laplacian kernels 
can be found in the context of matched filler theory [26]. A 
matched filter maximizes its response at the location of a well- 
defined known type of pattern in a noisy image. Detection is 
accomplished by sampling the output of the filter for values 
exceeding a chosen threshold. Strickland and Hahn [15] show 
that the high-frequency noise of mammograms can be modeled 
by a separable Markov process and that the average gray level 
profile of microcalcifications is well described by a circularly- 
symmetric Gaussian function. With these assumptions, (he 
resulting matched filter is a Laplacian kernel. 

For the estimation of the size in step 2) we initially applied 
standard methods, such as region growing techniques. These 
algorithms, however, depended on many implicit parameters 
and were not very stable since assumptions on the local 
contrast of the marked spots had to be made. Due to these 
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Fig. 3. Cms section of Ike 2-D Laplacian Alter with the zx plane for scale 
h a 4 and the corresponding view on utt xy plane. The circle indicates the 
intersection of the lobe with the plant. 



difficulties, we developed a method based on a mode) of a 
microcalcificntion and its properties in the scale-space. We 
will show that the Laplacian response of a spot, its scale- 
space signature, reveals the underlying parameters of the 
corresponding model. Finally, in step 3), we choose for each 
size D a large number of different threshold values to evaluate 
the FROC performance of the method. 

8. Laplacian Scale-Space 

The 2-D Laplacian-of-Gaussian (LoG) is defined by the 
application or the Laplacian operator A to the Gaussian 

where a > 0 is the standard deviation and x, y 6 R. Addi- 
tionally, a normalization factor -n(v) is introduced yielding 

LoG.(*.») = - n(c)(A G v {x,y)) 

The continuous parameter a is denoted as the scale of the filter. 
An important question is, for which values or a the image has 
to be convolved. Since images are given as a matrix or pixels, 
it is straightforward to measure the scale a as a number h of 
pixels. As depicted in Fig. 3, the scale or the filter is defined 
as ihe diameter of the central lobe, the positive part of the 
function above the xy plane, measured in pixels. The lobe 
intersects ihe xy plane in a circle with diameter 2\/2<r, thus, 
for a given scale A, the corresponding value or o is 

We define the scale-space representation of an image / as the 
convolution of Ihe Laplacian filter and the image 

[LoG* • /)(*,»/) (4) 

for the scales h «= hm&x- 
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fig. 4. Examplei ofnorrrmiicd Laplacian filler kernela for $eol«(«) h a S. 



The volume of the central lobe, calculated to 2/ea 7 , depends 
on c, which is inconvenient if Laplacian responses on different 
scales are directly compared to each other. Therefore, the filter 
is normalized with respect to the volume of the lobe. Hence, 
with (3) it follows 

(5) 

Fig. 4 shows examples of normalized Laplacian filler kernels. 

The Laplacian must be carefully sampled, since only a few 
values approximate the function for small scales. Following 
Neycenssac [27J, the size of the discrete filter kernel is 
chosen to be Zh. Even kernel sizes are increased by one. 
The fundamental properties of the Laplacian. the integral with 
respect to the xy plane is zero and the integral with respect to 
the central lobe is one due to the normalization, should also 
hold for discrete kernels. The constraints require an additional 
factor k x > 0 and a relaxation parameter k 3 > 1 in (2) 

i^(*,t,)=*i«(a)^(i-**^) 

. e - ( x> +y W. (6) 

The values of *i and ki are determined such that both 
constraints hold for the discrete kernels. For h = 1 the values 
are it, = 3.461 and *, = 3.537. for h = 2 the values 
are k x = 1.156 and kj = 1.002, and for h > 3 we have 
fc t = 1.000 and kj = 1.000, respectively. Because of the large 
relaxation values, scale A = 1 is not considered. Moreover, the 




Fig 3. Artificial image cmuitung of briglq (pon of different toe and local 
carina. At well, Gauttion noUe with tere mean and fixed ctenderd deviaiion 
it added. In die left row the Mack pixch mark the teauon pf teal maxima of 
■be Laplacian Altered im*£e for acalea (a) A = 2. (b) ft = 5, and (c) A = 10. 
T*ec<! - 



filter kernels are separated in one-dimensional (1-D) kernels 
to reduce the computational complexity of the convolution. 



C. Spot Detection 

Possible microcalcifications correspond to local maxima in 
the Laplacian filtered image. A pixel is considered as a local 
maximum if its value is the largest in a 3 x 3 neighborhood. 
Due to the smoothing properties of the Gaussian, a 3 x 3 
neighborhood is sufficient to robustly identify local extrema 
for large filter kernels, as well. The local max jmaare deno ted 
as candidate pixels in the following. 

To understand how candidate pixels are generated by the 
Laplacian, an artificial image is investigated. The image con- 
sists of several bright spots of different sizes and local contrast, 
as shown in Fig. 5, left row. Moreover, Gaussian noise with 
zero mean and constant standard deviation is added. The image 
and the Laplacian kernel are convolved on scales h = 2, 
/i = 5, and h = 10. On scale h = 2 several artifacts are 
generated in the background area, due to the image noise. 
These pixels mark local maxima with very small local contrast. 
For larger scales, the Gaussian blurring becomes apparent: the 



pixels of larger spots are dislocated. On scale h = 10, the 
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Fig. 6. (o) Paten of inanvnogram eOlo consisting of a mkrecaltificaiion 
cluster. (W The candidates are marked by the Laplacian detection applying 
pub tracking as black dots. The tut of the patch it 1.5 ere (100 pixel* 
correspond io J cm). 

influence of image structures in the neighborhood is evident: 
some of the local maxima are shifted from their true center. 

For reliable estimates of the size and local contrast of a 
bright spot, which is explained in the next section in more 
detail, it is important that spots are marked at their true center. 
Therefore, the localization of corresponding candidate pixels 
is improved by coarse-to-fine tracking, which is a well-known 
technique in applications of scale-space filtering (28H30). 
For example, large-scale edges in an image can be identified 
on a coarse scale, but more accurately located on a finer 
scale by following the zero crossings of the Uplecian. Here, 
displacements are identified by tracking the position of the 
local maxima in the scale-space. A path always starts with 
a candidate pixel at the finest scale h = 2 and ends at some 
coarser scale if an appropriate candidate pixel cannot be found 
in the intermediate 3x3 neighborhood. The length of a 
path in the scale-space depends on the local structure of the 
mammogram. Long paths rarely occur and most paths already 
end at scale h = 4. 

The candidate pixels of a path belong to the same spot, 
but their correlation with the true center of the spot may vary. 
Due to the normalization of the filler kernel with respect to the 
central lobe, the magnitude of the response is an indicator of 
the correspondence between the position of the candidate pixel 
and the actual center pixel of the spot The absolute maximum 
of the response along each path indicates the optimal candidate 
representing the spot in the image which is reported for each 
path. Fig. 6 depicts a patch of mammogram cOlo and the 
results or the Laplacian detection with path tracking. 

D. Feature Estimation by Signatures 

The estimation of the local contrast and the size of a spot is 
based on the scale-space representation of the image. First, a 
microcalcificaiion is modeled by a cylinder of a specific height 
C and diameter, D, which corresponds to the local contrast 
and the diameter of the spot in the image, respectively. An 
image /, consisting of a single cylinder with its center at 
M = (0,0). is defined by 

/<*.y)=(c if*W<£ (7) 
[ 0, otherwise. 
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fig. 7. Signature or the cylindrical microcalcincaiion model. The solid curve 
shows the Laplacian response of the model in M for C = 100 and D = 5. 
A change to C = 150 (higher peak) or D = )0 (peak to the right) yield ihe 
dashed carves, respectively. 

It is not necessary to introduce an additional parameter for the 
image background because Laplacian convolution is indepen- 
dent of a constant background. Obviously, for a fixed scale the 
center, M is a local maximum in the Laplacian filtered image. 
The response r cy! (a) in M is analytically calculated by 

rcy) (u) = [LoG, . f](x,y)| Mci(0i0) . (8) 

With (2) and (7) and polar coordinates u = r cos(d>) and 
v ss r ain(tf>), it follows that 

C f°' 2 r 2 " / r 2 \ 

/;"('-£>-"""*■*> 

The remaining integral can be calculated to (£>78)c-< l ' , /*'*> 
and with (3) and (5) it is 

revtW^Ce^e-^'J. (10) 

The graph of r tyl (/i), shown in Fig. 7, is referred to as the 
signature of the model. The signature has a characteristic 
peak which reflects the underlying parameters of the model. 
By differentiation of (10), the coordinate h um of the local 
maximum in the graph evaluates to 

>>,nix = D (II) 



rc,i(/w) = C. (12) 

A candidate pixel in the mammogram has a reasonable 
correlation will) ihe microcalcificaiion model. Therefore, the 
size D' and the local contrast C* of ihe spot are determined 
from the actual signature r*(n) by searching for the position 
'>mu of the characteristic peak. With (1 1) and (12) it follows 
D* s h mM and C = r*(nnuut). A visual inspection shows 
that candidate pixels in the mammogram have a well-defined 
peak, which results in a stable and robust estimation. 

In Section lll-C, path tracking was introduced to guarantee 
that the artificial spots in Fig. S are marked at their true center, 
since the proposed estimation is only valid for signatures cal- 
culated in M. In the case of mammograms, surrounding bright 
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(issue might displace candidate pixels as well, in particular, 
for larger filler kernels. Therefore, path tracking is applied 
lo improve the correlation with the model and to accurately 
estimate the local contrast of candidates. 

In Fig. 9, signatures for several candidate pixels in the 
mammogram are given. For example, the first candidate has 
its local maximum at scale A uw > — 6 resulting in an estimated 
size of D' = 6. The contrast C is larger than the predefined 
threshold Cr(D') and the candidate is marked as a micro- 
calcification. The response of the second candidate has two 
local maxima for h mtx - 3 and A™, - M. The maximum 
on the coarser scale, however, is not taken into account, since 
image structures in the neighborhood dominate the signature. 
As in the first example, this candidate is also reported as a 
microcalcificaiioo. In the last example, we have a clear peak 
at h max = 3, but the estimated local contrast C is smaller than 
Ct(D') and the candidate is not marked. Fig. 8 also depicts 
the results for the microcalcification cluster already shown in 
Fig. 6. It should be noted that from the detection point of view, 
there is no need to segment individual microcalcifications from 
the background tissue. 

Following Strickland and Harm [13], the 2-D Gaussian 



... _ D*tf 
r ^ )=Ce (Wji' 

The position of the local maximum evaluates to 



Due to these equations, the cylindrical and the C: 
are equivalent if the response r^ w [h) is properly 




Fig. 9. Signature (+) of ihree candidate pixels, shown in Fig. Wo), com- 
ptred lo the signature of the model for the estimated site D' end * predefined 
contrast threshold CV(O') (solid* 



(13) 



is a more reasonable model for a microcalcification. Similar 
n of (10) the response r^ m {h) in M - (0,0) 




Fig. 10. Signature of the Gi 



Fig. 10, the graph of r^ m [h) is compared to the response of 
the cylindrical model. The algorithm could only take advantage 
of the slight difference between the curves near their local 
maximum if the comparison of C and C T {0') is extended 
to a small neighborhood around D'. The same arguments also 
hold for elongated ellipsoidal models, which might be more 
suited for the malignant type of microcalcifications. A visual 
inspection of the candidate signatures in the mammograms 



scaled. In disturbed by neighboring structures. Therefore, we use only the 
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local contrast information at position D\ which is sufficiently 
d by d 



£ Image Noise 

The noise characteristic of digitized mammograms is com- 
plex and strongly depends on the gray level itself. The noise 
level is considerably higher in bright regions oT the mammo- 
gram representing glandular tissue. As a result, the statistical 
variation of features extracted from different regions depends 
on the predominant type of tissue. For example, a given 
threshold Or might identify microcalcifications in dark re- 
gions, but will generate many FP detections in bright regions. 
If the threshold is increased the FP detections are reduced, 
but faint microcalcifications are lost in the darker areas of 
the mammogram. To guarantee the same detectability for 
microcalcifications in different image regions, the feature 
estimation algorithm must consider the gray level dependency 
of the noise. A solution to this problem is the use of adaptive 
methods which vary Or according to the local noise variance. 
In work on microcalcification. detection adaptive methods can 
be found in Chan ei al. [31] and Davies and Dance [4J. 

Karssemeijer and van Eming (1 1), [32) describe a different 
approach. They propose a noise equalization procedure applied 
prior to any subsequent image processing. The equalization can 
be understood as a nonlinear transformation L(y) of the gray 
values y to a scale on which the ooise level is constant. The 
image noise is measured by the local image contrast 



where Vi is Hre 6 rav ,evel at si,e »• ^» denotes neighboring sites 
around i, and N the total number or neighbors. The statistics 
or Cj, in particular the mean p and the standard deviation a, 
are used to represent the noise characteristic. 

Given a gray level band 6 = \jf - k,y + k] with k > 0, 
a suitable sample or sites x with /(z) e b is needed to get 
reliable estimates of the distribution. In mammography this is 
not a particular problem because of the large size of the image 
matrix. Karssemeijer divides the entire grayscale " 



calculated. We are using a different approach, which results in 
a larger number of bands and more detailed measurements. For 
an arbitrary gray value y. a corresponding band b is constructed 
by successively adding the neighboring levels y ± 1, y ±2, • • • 
until the band consists of a minimum number of pixel sites. 
For each band o, the histogram of the local contrast c,- is 
determined, followed by the calculation of the mean n{y) and 
the standard deviation a(y) of the bands. 

Experiments show that already about 1500 image sites are 
sufficient to yield stable estimates or the moments of the 
histogram. The normalized histograms resemble a Gaussian 
distribution with zero mean. Fig. II depicts the estimated 
standard deviation a(y) of the local contrast as a function 
of the gray value for the Nijmegen mammogram c!5c. 

Given constant noise level o„ the scale transformation 
L{y) can be determined from the standard deviation <r(y) by 
numerical integration. The transformation itself is represented 




value for ihe Nymegen m 
to 8 biu prim to the measun 
of a tingle band. 




(17) 



by a look-up table. We do not intend to describe the calculation 
of L(y) in more detail, since the Nijmegen database already 
contains an appropriate look-up table for each mammogram, 
which we use for the experiments in this paper. However, 
for the calculation of Ihe Laplacian feature noise, which is 
described in Section III-E2, we apply the noise estimation, 
as proposed above. The average noise level of the reseated 
Nijmegen mammograms is c P a 1.98. In Fig. 12, the Cans- 
formation L(y) for mammogram c 15c is shown. In the next 
section, we consider consequences of the consideration of 
Gaussian noise. 

J) Noise-Corrcatd Microcalcification model: Due to noise- 
equalization, mammograms have a constant noise level o>. 
After Laplacian convolution the noise still has a Gaussian 
characteristic, but the remaining noise level depends on the 
degree of smoothing and, hence, on the size of the filter kernel . 
The response of a candidate pixel should consider the noise 
level as a function of scale, otherwise, the local contrast is not 



Therefore, we complement the cylindrical microcalcification 
model by Gaussian noise with standard deviation o> and 
zero mean. Since convolution is a linear operation, signal 
and additive noise can be considered separately. The noise 
component yields the additional term o(k) in the signa- 
ture r cy i(ri) in (10) or the microcalcification model. For 
the calculation of <r(h), the change or Ihe gray value at 
by independent random variables -V(t') 
N(0,o r ). The random variables X[i) are 
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TABLE I 

Number op Canuumtes of Mammogram cI5c for 
Scales J> = 2, • ,7 in the Case op Uncorrected 
Laflacian Responses and in Tm Case of Noise Correction 



especially. Tor small values of D, the noise term significantly 
s the peak, which is reflected by 



Fig. 13. Signature of ibe cylindrical 
terra (tolid) and without nobe term (dashed). The nmu level m 2.0. The 
loci contrast end the size on (a) C a SO, D - J and (b) C ■ t.D = 2. 



convolved with the Laplacian at scale ft 

Z k {i) = (LoGa . A](t) = £ LoGhO)A'« + i) ( l» 

yielding new random variables Since Z h (i) is a linear 
combination of independent random variables the variance of 
Z h (i) evaluates to 

jet* 

= V«r(A'(,))£± 

(-9) 

and thus, the noise level after convolution at scale h is 

4»—'7sb (20) 

Addition of (10) and (20) yields the response tw^/i) of the 
noise-corrected cylindrical microcalcificauon model 



r.»i»(/.)=r CTI (/i) + a(/.) 



(21) 



The graph of TW^f/i) is shown in Fig. 13. 

Analytic differentiation of (21) to determine the position of 
the local maximum is difficult Plots of r^^h) make evident 
that the noise term o(h) does not significantly alter the position 
of the characteristic peak if parameters are in the range used 
by the detection scheme. The model response for large values 
of C and noise levels c r around 2.0 is much the same as for 
the uncorrected model. For models with low conirast C and. 



•«*.(D)=C + . 



' r J* D' 



(22) 



Without consideration of the noise term o(/»), the estimated 
contrast of a candidate pixel is too large, especially for faint 
microcalcificaiions with low contrast, resulting in numerous 
FP detections. Moreover, if mammograms with different noise 
level o T are considered, the sensitivity of the detection is 
biased by o T . Therefore, the signature of a candidate pixel 
is corrected by subtracting the noise term o(h). The corrected 
response is independent of image noise, especially of the noise 
level o r of a particular mammogram. 

A probably more severe problem is that the image noise 
also affects the tracking of candidate pixels. Because of 
too large responses on small scales along a path in the 
scale-space, the absolute maximum of the responses and, 
heme, 4he optimal, candidate pixel with best correlation to the 
model, is systematically shifted toward smaller scales. This 
is demonstrated in Table I. Also, the response along the path 
must be corrected by subtraction of the noise term given in 
(20). Here, the subtraction of 1/ft is sufficient. Constants and 
the image-dependent noise level o> have no effect on the 
search of the maximum, since only responses of the same 
candidate and image are compared to each other. 

2) Laplacian Feature Noise: The decision to report a can- 
didate pixel as a microcalcification only depends on the 
threshold Or and, hence, on a reliable estimate of the local 
contrast of the corresponding spot, which must be independent 
of the actual noise characteristic. We investigate the variation 
of ihe extracted image features, which is the result of the image 
noise in more detail. For small scales, the local contrast kernel 
used for the noise estimation and the Laplacian kernel are 
very similar. The contrast kernel is a high-pass filler and more 
or less an approximation of the Laplacian. Thus, it cannot 
be expected that the estimation of the image noise or of the 
Laplacian feature noise, respectively, using either the contrast 
or the Laplacian convolution, are very different. 

The detection algorithm, however, considers features only 
for candidate pixels which correspond to local maxima in the 
convolved mammogram. From this point of view, the variation 
of the Laplacian kernel should not be determined at all sites i 
in the mammogram, but only for the candidate sites of a given 
gray value band. Fig. 14 depicts the standard deviation and the 
mean of the Laplacian evaluated only at candidate pixels in 
the noise-equalized mammogram cISc. As was to be expected, 
the standard deviation is consiant, but the mean is not zero. It 
shows a remarkable dependency on the gray level. If all sites 
are considered in the calculation, then the number of darker 
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Rg. 14. Laplaciar. feature Militia for the nota-equaliicd mammogram 
die for teak h = 2. The feature it calculated only at candidate niea. 

and brighter pixel values or, co be more precise, ihe number 
and latitude of positive and negative differences, is on average 
the same. Therefore, the mean is zero. The Laplacian is only 
calculated for local maxima i. The neighboring pixel values 
are all smaller than the value al site »', which results in a shift 
of the prior probabilities and thus in a nonzero mean. 

The dependence of the mean on the gray level must be 
considered by the detection scheme as well. Otherwise, micro- 
calcifications are not detected with the same sensitivity since 
the comparisons C > Or[D) are biased by the nonconstant 
mean. The implementation of the adaptation is straightforward. 
For each mammogram and each scale, linear regression is 
applied to approximate the mean values by a straight line with 
slope oa. Let y be the gray level of a candidate pixel. Then, 
the actual response r'{h) of the candidate pixel is corrected 
according to the slope rtj, by 

r'{h,y) = r'(h)-a h y (23) 

The largest slope of the line found is <* 2 = 0.0074 (mam- 
mogram c02o), which results in a considerable change of the 
threshold of 1.9 for y = 255. The shift of the mean is not very 
pronounced for larger scales. We correct only scale « = 2 and 
h as 3. It should be noted that the calculation of the feature 
statistics needs particular attention, since the local maximum 
constraint significantly reduces the number of sites considered 
in each gray level band. An analysis of the histogram for each 
band makes evident that a Gaussian still yields a reasonable 
approximation. 

F. Implementation 

Prior to any image processing the mammograms are noise- 
equalized by the look-up tables contained in the Nijmegen 
database. Each mammogram has an individual noise level o>. 
The breast tissue area is automatically segmented to avoid 



detection artifacts caused by markers and the sharp image 
boundary near the chest side. To reduce the computational 
burden of the method, the size of microcalcifkauons to be 
detected is limited to' a maximal size of D as 7. For each 
size D as 2, ■ • • , 7, a contrast threshold Gr(D) is selected to 
control the detection sensitivity or the algorithm. The calcu- 
lation of the scale-space representation of a mammogram up 
to a maximal scale h = 12 is sufficient to search for the local 
maximum of the signatures. Furthermore, all convolutions ore 
restricted to the breast tissue area. On average, only about 39% 
of the imBge matrix has to be convolved with the Laplacian. 

1) Spot detection: 

a) find local maxima on scale n = 2; 

b) initialize for each finding a path and follow the local 
maxima in the scale-space up to the scale ft = 12; 

c) correct the response along the path by subtraction of 
1/ft; 

d) identify the absolute maximum of the response and 
report the corresponding pixel as candidate. 

2) Estimation of site and local contrast. 

a) determine the signature r'{h) of the candidate; 

b) correct the signature according to the noise term 
o{h) given in (20); 

c) correct the signature according to (23) to adapt to 
the shift of the mean for scale ft = 2 and ft - 3; 

d) identify the characteristic peak and report C and 
D*. 

3) Threshold decision: 

a) mark the candidate as microcalcificaiion if C is 
larger than Gr(D*). 

The average time needed to process a Nijmegen mam- 
mogram is about 15 s on a SCI Indigo workstation (R44O0 
processor, 200 MHz, 128-MByte RAM). This time is almost 
completely spent for the generation of the candidate pixels, 
which involves the convolution with the Laplacian kernels and 
the search for the local maxima. The corrections, the search of 
the peak in the signature, and the comparison with the contrast 
thresholds is within one second. The convolution of the entire 
mammography tissue is only necessary for scales ft = 2 and 
n = 3 to find the slopes o 3 and oj, respectively, and to 
identify the path entries. The feature statistics are determined 
by the procedure described in Section III-E. During a trace of 
a path in the scale-space, convolutions are only needed in a 
small neighborhood. Convolutions with large kernels are rare, 
since most paths are short. Such convolutions are implemented 
by a demand-driven calculation model which guarantees that 
each response is only determined once. The compulation of 
signatures is also optimized. It is sufficient to determine that 
the signature only up to the characteristic peak is identified. 

All parameters are tuned for mammograms with a 0.1 m 
pixel size. It turns out that a reasonable change of Ihe pa- 
rameters described above does not significantly change the 
detection results. Neither signatures larger than h — 12 nor 
mkrocalcifications larger than D = 7 improve the detection 
results. Also, the choice of larger neighborhoods for the search 
of the local maxima in the filtered mammogram, as well as for 
the path tracking, does not change the results. 
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IV. RESULTS 

The algorithm is executed for a fixed setting of thresholds 
Ct(D), TP and FP clusters arc counted within the mam- 
mograms of the database and the number of FP clusters per 
image and the relative number of TP clusters are determined 
according to the overlap rule. Since the threshold depends on 
the size D, a large number of different parameter settings must 
be tested to evaluate the performance of the algorithm. For 
example, if we use ten different thresholds for each of the 
sizes D = 2, • • • , 7 of interest, then the entire database must be 
evaluated 10° times. The entire search space can be reduced to 
a moderate number of tests without missing settings with good 
results. Therefore, a set of true microcalciflcations is compiled 
by visual inspection of several candidate pixels. Based on the 
histogram of the estimated contrast for each size, the search 
space is concentrated to a subset or 2374 different contrast 
settings. 

To demonstrate the performance of the basic method and the 



path tracking and the effect of image noise, we perform three 
FROC evaluations, summarized in Fig. 15. The plots reveal 
the results of all threshold settings used for the evaluation. 

The basic method combines steps la). Id), 2a), 2d),- and 
3a) of the algorithm. Since path tracking is not yet applied 
in the basic method, step la) is complemented by the local 
maxima for scales h - 3, ■ • • , 7, which introduces multiple 
candidate pixels for single spots. The corresponding evaluation 
in Fig, 15(a) shows a large number of FP detections. On 
average, the algorithm finds 1.5 FP clusters per image. 

In the next experiment, the detection is enhanced by path 
tracking used in step lb) of the algorithm. The results in 
Fig. 15(b) make it evident that the number of FP clusters per 
image significantly decreases. Compared to the basic scheme, 
several clusters are also missed, especially at large FP rates per 
image. Both observations can be explained by more accurate 
contrast estimates, since signatures are calculated at sites with 
optimal correlation to the microcajcification model because of 
path tracking in step lb). 

Finally, the last experiment also includes the remaining steps 
Ic), 2b), and 2c) which consider the effect of image noise on 
the signature of a candidate pixel. It can be shown that steps 
2b) and 2c) contribute the same amount to the increase of the 
results while noise-corrected path tracking introduced in step 
lc) only slightly improves the number of correctly identified 




0 1 2 3 4 S 



Fig. 15. (a) Evaluation of Iht bat* algorithm, (b) Complemented wllh paid 
of TP clintm per image) are shown for oil 2374 threshold settings with a TP 



Threshold selection and performance r 
same set of images usually introduces a bias toward the 
method. To effectively compare the FROC performance of 
the modifications of the basic method as well as the perfor- 
mance of different approaches, using the Nijmegen database 
in Section V, the selection of optimal threshold values b 
addressed by a different set of 25 training mammograms, 
consisting of a total number or 57 microcalcificaiion clusters. 
For given FP rates, settings with the best TP values are 
determined for the training images. If more than one setting is 
found for a particular FP rate, then the one with the smallest 
sum of thresholds on scales h = 2, h = 3, and h = 4 is chosen. 




These sellings are tested against the original database yielding 
the FROC curves shown in the Fig. 16. Although quantitative 
measures are not further assessed to effectively compare the 
detection results, the improvement of the results due to path 



the biased experiments depicted in Fig. IS. 
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Fig. 17. Unbiased comparison of FROC cv 
(+) uidthe MRF approach (A), tn iterate of Uplacian de 
(A of 25 training images is used to find optimal threshold 



A visual inspeclion of the FP clusters makes evident that, 
on average, all mammograms contribute the same number of 
FP clusters. It turns out that, especially for low threshold 
settings, locations on thin line structures of glandular tissue 
are regarded as FP clusters. The calculation of the signature 
for a corresponding model of a line shows a similar response 
as a cylindrical microcalcification model for an appropriate 
choice of parameters D and C. As a result, a candidate on a 
line might be regarded as a microcalcificaiion for small values 
of Or. An increase of Or might eliminate line artifacts in 
particular cases while, on the other hand, faint calcifications 
may be lost. 

V. COMPARISON 

Karssemeijer's MRF approach [UJ allows us to model 
prior knowledge of spatial relations between objects within an 
iterative segmentation process. As an advantage, additional in- 
formation on the distribution of individual findings is included 
in the detection scheme. Pixels likely to be microcalcifications 
are rewarded if, in some spatial context, other microcalcifica- 
tion pixels already occur. Moreover, a line feature calculated 
at each image site ensures that microcalcification and line 
structures cannot be mixed up in an immediate neighborhood 
at the same lime. The corresponding feature distributions are 
approximated by a multivariate Gaussian distribution, relying 
on a detailed segmentation of true microcalcifications provided 
by expert radiologists. The approach proposed in this paper 
marks individual microcalcifications. Line artifacts are only 
removed by the threshold criterion, introducing several FP 
detections, as mentioned in Section IV. 

Karssemeijer uses a different nonstandard representation of 
the FROC results, which is based on the mean TP fraction. 
The mean TP fraction is defined like the usual TP fraction, 
but die number of TP clusters .of an individual mammogram 
is weighted by 1/7V where N is the total number of true 
clusters in ihe mammogram. This guarantees that each image 
contributes the same amount in the FROC results, independent 
of the number of true clusters. The threshold settings are 
trained with the additional training mammograms to avoid a 
bias toward the Uplacian approach. As depicted in Fig. 17, 
the results of the Laplacian appear slightly preferable, but the 
MRF approach is able to operate also at FP rates smaller than 
O.S per image. 



VI. CONCLUSIONS AND DISCUSSION 

A method for the detection of individual microcalcifications 
based on the Laplacian scale-space representation of a mam- 
mogram is proposed. The initial identification or candidate 
pixels as well as the estimation of their size and local contrast 
uses only information provided by this representation. The sen- 
sitivity of the detection is controlled by thresholds depending 
on the estimated size of a candidate. The thresholds have a 
well-defined interpretation. A threshold value is considered as 
the minimal local contrast of a finding required as an indicator 
of a possible microcalcification. The training of the thresholds 
does not rely on the feature distribution or on additional 
estimation procedures. It is simply accomplished by a FROC 
evaluation of a set of training mammograms. The algorithm 
itself depends on very few intrinsic parameters, for example, 
the length of a signature or the size of the neighborhood 
for determination of local maxima. These parameters ore not 
crucial with respect to the detection performance and can be 
adapted to mammograms with smaller pixel sizes. This results 
in a very stable and robust approach. 

The FROC performance of the proposed method is compa- 
rable to the results of Karssemeijer's MRP approach, which 
achieves the best FROC results for the Nijmegen database. 
In particular, at 0.5 FP detections per image, the mean TP 
fraction is about 0.9. This is remarkable since only individual 
microcalcifications are addressed by the detection and the 
training stage depends only on additional mammograms with 
ground truth. 

In contrast to the MRF approach, the developed algorithm is 
not able to operate with less than 0.3 FP detections per image at 
present. This can be explained by the fact that artifacts, such as 
scratches and film emulsion errors, are not explicitly identified 
and are only partly removed by the overlap rule during the 
FROC evaluation. Another source of FP errors is line artifacts. 
An additional algorithm which recognizes findings on line 
structures may reduce the number of FP clusters [33]. The 
MRF approach certainly benefits by the additional line feature. 

The FROC representation, based on the mean TP fraction 
chosen by Karssemeijer, might be misleading when different 
methods are compared. For example, the mean TP value of 
a cluster in a mammogram with a total number of four true 
clusters is only 0.25, whereas in a mammogram with just one 
cluster theTPFis 1.0. In the first situation four clusters have to 
be marked, while in the other the detection of a single cluster 
is sufficient to yield identical values. Table II compares the 
detection results for a threshold setting with an FP rate of 1.0 
per image. The Laplacian detection misses only 12 clusters 
out of 105, compared to 17 clusters in the case of the MRF 
approach. This corresponds to a TP fraction of 0.89 and 0.84, 
respectively, but only to equal values of 0.92 in the case of a 
mean TP fraction representation of the FROC results. 

h should be noted, that several images of the Nijmegen data- 
base, such as cl2c and cl7c, contain numerous overlapping 
clusters. It is difficult to consistently apply a cluster detection 
criterion in these cases, in particular, when the number of 
computer-generated findings is large. The counting of TP and 
FT clustei ' 
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The method described here detects individual microcal- 
cification. Particular knowledge about the image domain, 
clinically suspicious microcalciflcations appear in clusters, is 
not used so Tar and leaves space for improvements to the 
approach. The application or a neighborhood interaction model 
for individual findings seems most promising. A probabilistic 
relaxation scheme [34J, a spatial scan statistic |35J, or an ad- 
ditional clustering transformation, as suggested by Nishikawa 
el at. [36], might therefore be used. 

The spatial resolution of the Nijmegen mammograms of 
0.1 mm pixel size may appear rather limited compared to 
the actual size of faint microcalcificaiions. Most work on 
raicrocalcifications, however, is evaluated with mammograms 
of 0.1 m pixel size. The dependence of detection accuracy 
on the pixel size and depth of the digitized mammograms is 
addressed in a study by Chan ttal. [37]. Their results indicate 
that detection accuracy Is essentially independent of a pixel 
depth from 12 bits to 9 bits, but significantly decreases as 
the pixel size increases. Robustness of detection algorithms 
against changes of the spatial resolution of images, however, 
might depend on the applied method as well. 
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